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ABSTRACT 

Non-Watson-Crick pairs like the G-U wobble are frequent in RNA duplexes. Their geometric dissimilarity (nonisostericity) with 
the Watson-Crick base pairs and among themselves imparts structural variations decisive for biological functions. Through a 
novel circular representation of base pairs, a simple and general metric scheme for quantification of base-pair nonisostericity, 
in terms of residual twist and radial difference that can also envisage its mechanistic effect, is proposed. The scheme is 
exemplified by G-U and U-G wobble pairs, and their predicable local effects on helical twist angle are validated by MD 
simulations. New insights into a possible rationale for contextual occurrence of G-U and other non-WC pairs, as well as the 
influence of a G-U pair on other non-Watson-Crick pair neighborhood and RNA-protein interactions are obtained from 
analysis of crystal structure data. A few instances of RNA-protein interactions along the major groove are documented in 
addition to the well-recognized interaction of the G-U pair along the minor groove. The nonisostericity-mediated influence of 
wobble pairs for facilitating helical packing through long-range interactions in ribosomal RNAs is also reviewed. 
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INTRODUCTION 

Watson-Crick (WC) base pairs have dominated the double 
helical landscape ever since the elucidation of the structure 
of DNA. Although the four bases with their characteristic dis- 
tribution of hydrogen bond donors and acceptors offer a va- 
riety of base-pairing interactions, WC pairs are singularly 
inimitable, due to their implicit geometrical equivalence 
and self-isostericity, conferring a unique and distinct ability 
of interchangeability of paired bases quintessential for uni- 
form duplex. Unlike DNA, RNA duplexes are not composed 
entirely of WC base pairs. Base pairs other than the WC pre- 
vail in the RNA world. 

Among the many non-WC (nWC) pairs that occur in 
RNA, the G-U wobbles have attracted greater attention 
because of their abundance, next only to WC pairs, and 
due to their biological implications (Strobel and Cech 
1995; Hermann and Westhof 1999; Masquida and Westhof 
2000; Varani and McClain 2000). The G-U pair was one of 
the first nWC pairs proposed by Crick to account for the 
degeneracy of the genetic code (Crick 1966). Structural inves- 
tigations of yeast tRNA phe (Ladner et al. 1975; Quigley et al. 
1975; Stout et al. 1976) revealed a G-U wobble pair held to- 
gether by two hydrogen bonds. The characteristic hydrogen 
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bonding pattern renders the glycosidic bond angles at G 
(A G = 40°) and at U (Au = 65°) to be unequal, making A. dis- 
tinctive from those seen for WC base pairs. This forms the 
origin of its nonisostericity with WC pairs and non-self-iso- 
stericity within the G-U pair. Such and other types of subtle 
but distinguishing geometric variations among nWC pairs 
contribute to their nonisostericity and limit their replace- 
ment, either with WC pairs or with other nWC pairs, espe- 
cially in a WC pair-dominated duplex, without entailing 
deformations of RNA duplex. Presence of such nonisosteric 
base interruptions in a WC duplex may also result in unique 
structural traits for nWC pairs to serve as recognition 
elements. 

Diversity of nWC base pairs and their significance began to 
emerge with the elucidation of structures of RNA systems. 
Every new RNA structure offered insights into the effects of 
nWC pairs on structural, functional, and thermodynamic 
stability of the WC pair dominant duplex. Efforts to systemati- 
cally characterize different nWC base pairs have resulted in 
various classification and nomenclature schemes (Saenger 
1984; Leontis and Westhof 2001; Nagaswamy et al. 2002; 
Lee and Gutell 2004) . However, precise understanding of their 
possible mechanistic influence warrants quantification of 
their unique geometric properties particularly in comparison 
with WC pairs. We propose here a simple metric scheme to 
characterize nonisostericity between any two base pairs in ge- 
neral. It has the intrinsic merit to anticipate the effects of base- 
pair nonisostericity. We demonstrate this by identifying an 
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innate residual twist as the critical geometrical feature that ren- 
ders G-U and U-G pairs to be nonisosteric with WC pairs and 
further predict its influence in terms of local under- or over- 
winding in a RNA duplex, in striking accord with experimen- 
tal observations. 

CIRCULAR REPRESENTATION TO CHARACTERIZE 
ISOSTERICITY AND NONISOSTERICITY 
OF BASE PAIRS 

The isosteric or nonisosteric nature between any two base 
pairs can be described by overlaying base pairs encompassed 
by circles formed with CI' ... CI' atoms as the diameter, since 
physical context of isostericity/nonisostericity becomes 
readily apparent through such circular representations. 
Superposition of A.U, U.A, G.C, and C.G WC pair circles 
(Fig. 1A) results in a near-perfect alignment, especially the 
line joining their CI' ... CI' atoms. Interchange of U and A 
within A.U and G and C within a G.C pair retains isosteric fea- 
tures, rendering them to be also self-isosteric. In sharp con- 
trast, overlaying a G-U pair on a G.C base pair (a 
representative of a WC pair) reveals that the line joining the 
CI' ... CI' atoms of the G-U pair does not align with the line 
joining the CI' ... CI' atoms of the G.C pair but subtends a 
counter-clockwise twist of + 14° (Fig. IB). We define this in- 
nate twist as the residual twist (Af°) since it appears even prior 
to effecting the prescribed twist (f = 33° for an ideal ARNA) re- 
quired to generate a helical structure. Thus, the G-U pair is 



nonisosteric with the WC pair, and the degree of nonisoster- 
icity is equivalent to At On the contrary, when a U-G base 
pair follows a WC pair, it subtends a clockwise twist, defining 
a negative residual twist of At « — ll°(Fig. 1C). This difference 
in the magnitude as well as direction of Af compared to the 
G-U pair (Fig. IB) is because of the fact that G-U and U-G 
base pairs are not interchangeable (G-U / U-G), an attribute 
arising out of the nonequivalent glycosidic bond angles 
(\q # A.u). This lack of self-isostericity in the G-U and U-G 
wobble pair differentiates them from each other as well as 
from self-isosteric WC pairs. By the same arguments, super- 
positions of U-G and G-U pairs (5' U-G/G-U 3') and G-U 
andU-Gpairs (5' G-U/UG3') display strikingly large nonisos- 
tericity characterized by Af » +24.9° and Af » —24.9°, respec- 
tively (Fig. 1D,E). 

Further, the CI' ... CI' separations (diameters of the en- 
compassing circles) of G-U and WC pairs are not precisely 
identical but exhibit very small differences (Fig. IB). This is ex- 
pressed through the radial difference (Ar A). A radius higher 
than the preceding reference pair (G.C, in this case) is defined 
as positive (+Ar), and lower, as negative (—Af). It maybe read- 
ily surmised that positive and negative Ar might cause bulges 
or constrictions, respectively, in a duplex. Incidentally, the ra- 
dial difference between G-U (U-G) and WC pairs is negligibly 
small (0.05 A) and is, therefore, expected to exert little influ- 
ence. Seemingly then, the magnitude and the sign of Af are ad- 
equate to describe nonisostericity between Crick's wobble and 
WC pairs. 




FIGURE 1. Superposition of base pairs in their circular representations. (A) Self-isosteric WC 
pairs (G.C, C.G, A.U, U.A), (B) G-U pair on the 3' side of a G.C pair (a representative of a WC 
pair), (C) U-G pair on the 3' side of a G.C pair, (D) G-U pair on the 3' side of a U-G pair, and 
(£) U-G pair on the 3' side of a G-U pair. Each WC and nWC base pair is encompassed by a circle 
with the respective CI' ... CI' separation as the diameter. CI' atoms are shown by filled circles and 
hydrogen bonds by dotted lines. A representative WC pair (G.C) is colored green, while G-U and 
U-G are colored brown. Note the overwinding and underwinding effect that can be readily inferred 
by the counter-clockwise and clockwise movement of the red circles indicated by arrows. 



MECHANISTIC INFLUENCE OF 
NONISOSTERICITY (AO 

It does not entail much reason to foretell 
from above that one of the mechanistic ef- 
fects of residual twist between G-U and 
WC pairs would be in the form of influ- 
encing the local helical twists (Fig. 2). 
Accordingly, a G-U wobble pair succeed- 
ing a WC pair in a RNA duplex is expected 
to experience an effective helical twist of f 
+ Af (33° + 14.0° « 47.0°), while that pre- 
ceding a WC pair experiences a helical 
twist of f - Af (33° - 14° « 19°) (Table 1). 

On the other hand, the influence of a 
U-G pair would be opposite to that of a 
G-U pair (Table 1; Fig. 1C) in view of the 
non-self-isosteric nature of G-U and U-G 
pairs resulting in their opposite effect on 
the direction of Af. Thus, a U-G pair suc- 
ceeding a WC pair would experience an 
effective helical twist of f-Af (33°- 
11 0 »22°), while the same preceding a 
WC pair would experience an effective 
twist of f+Af (33°+ 11° « 44°) in a 
RNA duplex. Therefore, a U-G or G-U 
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FIGURE 2. Illustration of the mechanistic effects of base-pair nonisos- 
tericity (residual twist) between WC and Crick's wobble base pairs. A 
G-U wobble pair (gray) flanked on either side by G.C base pairs (black) 
when the helical twist t = 0°. Note the complete overlap of guanines and 
protrusion of uracil of the intervening G-U pair toward the major 
groove edge. The central G-U wobble subtends a counter-clockwise 
and clockwise twist with respect to the preceding and succeeding cyto- 
sine (along the direction of helical propagation), depicted by arrows. 
This results in a positive (Ar= +14°) and negative (At = —14°) residual 
twist manifesting as a local helical overwinding (33° + 14° = 47°) and lo- 
cal helical underwinding (33° - 14° = 19°) at the 5' G.C/G-U 3' and 5' 
G-U/G.C 3' steps, respectively, in the presence of a helical twist angle 
of 33°. 



wobble pair would induce a mechanistic influence in the form 
of local underwinding or overwinding. Most significantly, the 
negative and positive values of At directly correlate with 
underwinding and overwinding of the helical twist, respec- 
tively, enabling a straightforward understanding of the effect 
of base-pair nonisostericity. 

VALIDATION OF PREDICTED MECHANISTIC 
EFFECTS OF At 

In order to verify the predicted mechanistic effects of At, mo- 
lecular dynamics (MD) simulations (20 nsec using AMBER 
11.0) (Case et al. 2010) of RNA duplexes comprising G-U 
and U-G wobble pairs under different sequence contexts 
(Scheme A) have been performed. Excellent agreement (Fig. 
3) between the predicted helical twist angles based on the con- 
cept of residual twist, MD simulations, and crystal structure 
data supports the usefulness of the present scheme of defining 
nonisostericity between base pairs. 

MD simulation and X-ray crystal structure data suggest an 
average helical twist angle of 39.4° and 36.2° (overwinding), 
respectively, at the WC/G-U base step, and 24° and 28.2° 
(underwinding), respectively, at the G-U/WC base step 
(Scheme Al; Fig. 3A). This trend nearly reverses with the av- 
erage twist angles of 26.4° and 28.7° (underwinding) at the 
WC/U-G base step, and 39.6° and 37.4° (overwinding) at the 
U-G/WC base step (Scheme A2; Fig. 3B). In conformity with 
the above, a trend of underwinding (MD = 25.8°, X-ray = 
27.9°), overwinding (MD = 39.8°, X-ray = 37.1°), and under- 
winding (MD = 25.6°, X-ray = 28.2°) is observed (Fig. 3C) at 
the GU, UG, and GC steps, respectively, when a UG dinucleo- 



tide intervenes in a WC paired duplex (Scheme A3). Exactly 
the reverse pattern occurs at the GG, GU, and UC base steps 
(Fig. 3D), exhibiting overwinding (MD = 39.3°, X-ray = 
36.4°), underwinding (MD = 25.4°, X-ray = 27.8°), and over- 
winding (MD = 38.8°, X-ray =36.9°), respectively, in a GU 
dinucleotide interrupt (Scheme A4). 

Due to the large residual twist (At = 24.9°) between the 
nonisosteric G-U and U-G pairs, a high twist (MD = 39.8°, 
X-ray =37.1°) and a low twist (MD = 25.4°, X-ray = 27.8°) 
value persists at the tandem UG (Scheme A3 ) and GU steps 
(Scheme A4), respectively. Despite very high (Af=+24.9°) 
and very low (At = —24.9°) residual twist at the UG and GU 
base steps, predicted extreme overwinding and underwinding 
are moderated and absorbed by local changes in the sugar- 
phosphate backbone (see Discussion). As expected, the over- 
winding (MD = 39.4°, X-ray =37.5°) and underwinding 
(MD = 24°, X-ray = 26.9°) feature is observed (Fig. 3E) at 
the WC/G-U and G-U/WC base steps (Scheme A5). The heli- 
cal twist at the GG base step remains unchanged, with an av- 
erage value (MD = 30.8°, X-ray = 33.1°), as Af is 0° between 
them. 

STRUCTURAL CONTEXT OF G U AND U G 
WOBBLE PAIRS AMONG WC PAIRS 

In order to understand the role of wobble base pairs in influ- 
encing the immediate helical neighborhood, we systematically 
examined the occurrence of G-U and U-G wobble pairs and 
their structural context by analyzing high-resolution crystal 
structure data. One of the striking features of base-pair noni- 
sostericity-mediated structural effects is the differential base- 
stacking interaction of wobble G-U with flanking base pairs. 
A wobble G-U pair stacks better when it precedes a WC pair 
than when it succeeds it (Mizuno and Sundaralingam 1978). 
This is a direct consequence of the opposite effects of At, 
wherein underwinding at the 5' G-U/WC 3' step due to 
t — At results in better stacking as compared to a poor stacking 
interaction at the 5' WC/G-U 3' step due to overwinding me- 
diated by t + At. Thus, it is more likely that a wobble G-U pair 
occurs at the beginning of the helical stem (Table 2A) rather 
than at the end of the helix (Table 2B). Exactly opposite effects 
are anticipated for U-G wobble pairs due to reversal in the sign 
and magnitude of At Hence, loops closed by G-U pairs (stems 
beginning with G-U pairs) are more stable than those closed by 
U-G pairs (stems beginning with UG pairs) (Serraetal. 1993). 
These factors may explain the high occurrence of G-U and U-G 



Scheme Al: 5' GGGG G CCCC 

CCCC H GGGG 5' 



Scheme A3: 5' GGGG UG CCCC 

CCCC GU GGGG 5' 



Scheme A5: 5' GGGG GG CCCC 
CCCC UU GGGG 5' 



Scheme A2: 5' CCCC U GGGG 

GGGG G CCCC 5' 



Scheme A4: 5' GGGG GU CCCC 

CCCC UG GGGG 5' 
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TABLE 1. Base-pair nonisostericity matrix for c/s-WC/WC base pairs 




Quantification of base pair nonisostericity in terms of residual twist (At, blue) and radial difference (Ar, red) between WC and nWC pairs and 
between different nWC base pairs. Base steps with high and low residual twist and radial difference are highlighted in light blue and light red, 
respectively. Subset of the matrix depicting nonisostericity measure between WC and wobble pairs is enclosed by thick lines. The matrix is 
symmetric along the principal diagonal with opposite signs. 



wobble pairs at the beginning and end of helical stems, respec- 
tively, as also seen in ribosomal secondary structures (van 
Knippenberg et al. 1990; Gautheret et al. 1995; Szymahski 
etal. 2000). 

A large number of G-U and U-G wobble pairs flanked by 
WC and nWC base pairs are also found within the helix 
(Table 3). These are predominantly flanked by WC base pairs 
(62.6%). Among those that precede a G-U pair, a large num- 
ber are purine-pyrimidine base pairs (31.1%), compared to 
the pyrimidine-purine type (18.8%). Although it is shown 
above that the +Af mediated- overwinding brings forth in- 
tra-strand destacking, the degree of destacking is much lower 
when a purine-pyrimidine base pair precedes a G-U wobble 
than a pyrimidine-purine base pair (Gautheret et al. 1995). 
This is consistent with the thermodynamic studies on the 
near-neighbor effects, which indicate that G-U pairs preceded 
by a G.C base pair are more stable (He et al. 1991), thus ac- 



counting for their large occurrence (23.0%) compared to A.U 
base pairs (8.1%). 

On the other hand, among WC pairs that succeed a G-U 
wobble, pyrimidine-purine pairs occur more frequently 
(28.9%) than purine-pyrimidine base pairs (21.0%). A C.G 
base pair succeeding a G-U wobble is found to be thermody- 
namically more stable (He et al. 1991) and hence occurs in 
high frequency (22.7%) than G.C (14.0%), A.U (7.0%), 
and U.A base pairs (6.1%). 

STRUCTURAL CONTEXT OF WOBBLE PAIRS 
AMONG nWC PAIRS 

Interestingly, nWC pairs flanking wobble G-U base pairs 
exhibit strong positional preferences (Table 3). A large num- 
ber of them (31.5%) are found either preceding G-U or 
succeeding U-G pairs. On the other hand, occurrence of 
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Base Step 

FIGURE 3. Comparison of helical twist angles in RNA duplexes interrupted by wobble pairs. Average values of helical twist angles at different base 
steps in RNA duplexes interrupted by (A) G-U pair (Scheme Al ), (B) UG pair (Scheme A2), (C) UG pair followed by G-U pair (Scheme A3), (D) G-U 
pair followed by U-G pair (Scheme A4), and (£) two G-U pairs (Scheme A5). Twist angles based on residual twist (At) (black), MD simulations (blue), 
and crystal structure data (red) are shown. Differences between the predicted and the observed helical twist angle at WC/wobble or wobble/WC steps 
may be due to compensatory adjustments in the sugar-phosphate backbone conformations to minimize the effect of residual twist. Numbers at the 
bottom right of each panel denote the number of instances in X-ray structures found and considered for comparison. 



nWC pairs succeeding G-U or preceding U-G are rather less 
(5.8%) (Table 3). Preference for a specific nWC base pair is 
dictated by its ability to provide better stacking with the wob- 
ble base pair. 

A-G pair preceding G-U 

Among the preceding nWC pairs, an A-G pair is more fre- 
quent (34.6%) followed by U-G and G-U base pairs, similar 
to the trend observed in ribosomal RNA secondary structures 



(Gautheret et al. 1995). A-G base pairs preceding a G-U wob- 
ble are mostly involved in a frans-Hoogsteen/Sugar Edge 
{trans-Hl 'SE) base-pairing scheme (Leontis et al. 2002), in- 
volving N7(A) ...N2(G), N6(A) ...N3(G), and N6(A) ... 
02' (G) hydrogen bonds. This scheme entails nonisostericity 
to the extent of Ar= +0.4 A and At = -30.9° (Table 4; Fig. 
4F), warranting local underwinding concomitant with en- 
hanced intra-strand base stacking with the wobble G-U pair 
(Fig. 5A). Incidentally trans-H/SE A-G pairs are the most sta- 
ble members of the £rans-H/SE base-pair family (Mladek 
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TABLE 2. Frequency of terminal wobble pairs 





Helical schemes 


Total number of 
instances 


A 


, y 

Mill* 

u \ 


13 


B 


\. r 
.Mill 


05 


C 


, y 

Mill* 


02 


D 


.Mill 


29 



G-U and U-G pairs are frequently found at the beginning and at 
the end of the helix, respectively. 



et al. 2009) and are observed in kink-turn motifs (Klein et al. 
2001), GNRA tetraloops (Jucker et al. 1996), sarcin/ricin 
loops (Szewczak et al. 1993; Correll et al. 1999), and E-loops 
(Wimberly et al. 1993). Thermodynamics of loop formation 
indicate loops with tandem trans- H/SE A-G pairs preceding a 
G-U wobble are more stable (Walter et al. 1994). However, 
G-A base pairs with the same hydrogen bonding scheme 



are rarely observed preceding a G-U wobble, occurring at 
just two sites (1.1%). It may be noted that this base-pairing 
scheme renders the G-A pair to be non-self-isosteric and sub- 
tends a high value of residual twist {At = +59.5°) and Ar = 
+0.4 A with the succeeding G-U pair (Fig. 4G), causing signif- 
icant overwinding and poor inter-strand base stacking (Fig. 
5B), thus accounting for their rare occurrence. 

U-G pair preceding G-U 

A U-G wobble is the second most frequently observed nWC 
pair preceding a G-U wobble (21.7%) and is also the most 
abundant among tandem wobble motifs (Gautheret et al. 
1995; Masquida and Westhof 2000). Despite a large At 
(+24.9°) with the succeeding G-U wobble (Fig. 4C), high oc- 
currences of this motif are attributed to its thermodynamic 
stability (Wu et al. 1995) due to strong inter-strand stacking 
(between the guanines) and —At (—11.0° and — 14.0°)-medi- 
ated enhanced intra-strand stacking with the flanking WC 
pairs (Biswas et al. 1997; Deng and Sundaralingam 2000). 

G-U pair preceding G*U 

Two successive G-U base pairs are the next most frequent- 
ly observed (18.9%) amid WC paired RNA duplex. 
Isostericity of base pairs (Af=0°) at the tandem interface 
maintains an average helical twist angle of 33°, causing inter- 
mediate base-stacking interactions (Masquida and Westhof 
2000). However, compared to the previous case, this motif 
exhibits diminished stacking with the flanking WC pairs ow- 
ing to a high (+14.0°) and low At (-14.0°) with the preceding 
and succeeding WC base pairs (Fig. 4E). Also, thermody- 
namic studies have shown that duplexes comprising succes- 
sive G-U pairs are less stable compared to those with the 
U-G/G-U motif (He et al. 1991) and, therefore, less frequently 
observed. MD simulations of miniduplexes comprising 



TABLE 3. Frequency and structural context of intra-hel 


cal wobble pairs 


and their flanking 


base pairs 






Frequency of fl 


anking WC 


Frequency of preceding nWC 


Frequency of succeeding nWC 


Total no. of occurrences of G-U base pairs 5' WC/G-U/WC 3' 


base pairs 5' nWC/G-lVWC 3' 


base pairs 5' WC/G-U/nWC 3' 


568(100%) 356(62.67%) 


179 (31.51%) 


33 (5.80%) 


GC/G-U/WC 


82 (23.03%) 


A-G/G-UAVC 


62 (34.63%) 


WC/G-U/A-G 


09 (27.27%) 


CG/G-U/WC 


44 (12.35%) 


U-G/G-U/WC 


39 (21.78%) 


WC/G-U/G-A 


09 (27.27%) 


AU/G-U/WC 


29 (08.14%) 


G-U/G-U/WC 


34 (1 8.99%) 


WC/G-U/U-G 


05 (15.15%) 


UA/G-U/WC 


23 (06.46%) 


A-C/G-UAA/C 


1 3 (07.26%) 


WC/G-U/A-C 


03 (09.09%) 


WC/G-U/CG 


81 (22.75%) 


C-U/G-UAVC 


12 (06.70%) 


WC/G-U/U-U 


03 (09.09%) 


WC/G-U/GC 


50 (14.04%) 


U-U/G-U/WC 


07 (03.91%) 


WC/G-U/C-A 


02 (06.06%) 


WC/G-U/AU 


25 (07.02%) 


G-G/G-U/WC 


05 (02.79%) 


WC/G-U/A-A 


02 (06.06%) 


WC/G-U/UA 


22 (06.17%) 


A-A/G-UAA/C 


05 (02.79%) 










G-A/G-UAA/C 


02 (01.11%) 







Occurrences of nWC pairs preceding wobble G-U are comparatively more than those that succeed it. Note 5' nWC/G-U 3' is equivalent to 5' 
U-G/nWC 3'. 
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TABLE 4. Base-pair nonisostericity matrix for f/ans-H/SE base pairs 



\3' 
5' \^ 


WC 


GU 


UG 


/-H/SE 
A-U 


/-H/SE 

U-A 


t-WSE 
AC 


/-H/SE 

C-A 


t-WSE 

AG 


/-H/SE 

GA* 


/-H/SE 

UG 


/-H/SE 

GU* 


/-H/SE 

c-u 


/-H/SE 

U-C* 


/-H/SE 

c-c 


/-H/SE 

A-A 


/-H/SE 

GG 


WC 


0.0 


+ 14.0 


-11.0 


+47.2 


+38.9 


+50.6 


+48.5 


+44.8 


-45.6 


+46.1 


-44.5 


+52.3 


-53.8 


+49.7 


+51.2 


+17.0 


GU 


+0.05 




-24.9 


+33.2 


+24.9 


+36.7 


+32.4 


+30.9 


-59.5 


+32.1 


-58.5 


+38.3 


-67.8 


+35.7 
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Comparison of residual twists (At, green) and radial differences (Ar, brown) of f/ans-H/SE nWC pairs with WC, c/'s-G-U, and c/s-U-G base pairs 
(gray). Base steps with low residual twist are highlighted in light green. Note that not all base-pair combinations of this family are feasible. 
G-A*, G-U*, and U-C* represent f/ans-H/SE A-G, U-G, and GU base pairs, respectively, but in flipped orientation. Glycosidic bond angles 
and C1' ... C1' separations are as suggested by Leontis et al. (2002). The matrix is symmetric along the principal diagonal with opposite signs. 



consecutive G-U base pairs of four or more in a WC duplex 
suggest that loss of hydrogen bonds leads to a less stable na- 
ture of such duplexes (data not shown). 

A-G pair succeeding G-U 

Among the succeeding nWC pairs, A-G pairs occur frequently 
(27.2%). It is observed that the succeeding A-G base pairs in 
nearly every situation are paired in the ds-WC/WC scheme 
(Fig. 5C). A ds-WC/WC A-G base-pairing scheme at this 
step promotes better stacking (Fig. 5C) due to the low At 
(—6.2°) -mediated underwinding (Fig. 4H), compared to the 
large Af (+30.9°)-mediated poor stacking interactions of a 
rrans-H/SE scheme. Thus, the observed base-pairing schemes 
for the preceding and succeeding A-G pairs (Fig. 5A,C) induce 
negative At with the wobble G-U pair enhancing base stacking 



and stability, providing a rationale for the observed juxtaposi- 
tions of A-G and G-U wobble pairs. 



G-A pair succeeding G-U 

G-A pairs succeeding a G-U wobble pair are equally observed 
(27.2%). GA pairs, like A-G pairs, exhibit variations in base- 
pairing schemes. It is seen that succeeding G-A pairs within 
the helical stem are paired via a ds-WC/WC scheme, while 
those at the helical termini closing loops are paired in a 
/rans-H/SE scheme. 

ds-WC/WC G-A pairs are accommodated within the helix 
without causing significant changes to the duplex structure 
(Leonard et al. 1994). It promotes better intra-strand base 
stacking with the preceding wobble G-U (Fig. 5D) due to the 
low At (— 5. 4°) -mediated underwinding (Fig. 41). 
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FIGURE 4. Structural context of At. Predicted residual twist angle between various base-pair 
steps (A-/). +Af suggests overwinding, accompanied almost always by diminished intra-strand 
stacking and vice versa. nWC base pairs paired via tmns-WSE are italicized. 



faces toward the helix (Gautheret et al. 
1995) and, as such, has the lowest thermo- 
dynamic stability (He et al. 1 99 1 ) , explain- 
ing the extremely low occurrence of this 
motif. 

Apart from the above-discussed nWC 
base pairs, A-C, CU, U-U, A-A, G-G 
and A-C, U-U, C-A, A-A are the other 
nWC pairs found preceding or suc- 
ceeding G-U wobble pairs, respectively. 
They occur in fewer numbers and do 
not show any regular pattern. 

G-U AND UG WOBBLE 
PAIRS IN PROTEIN-RNA 
INTERACTIONS 



On the other hand, presence of trans- H/SE pairs within the 
helical region are expected to significantly alter the regular 
backbone geometry (Baeyens et al. 1996) and hence would 
be preferred at helical breakpoints or at helical termini closing 
loops, wherein such G-A pairs, predominantly, stack well 
(Burkard et al. 1999). Most interestingly, G-U wobble pairs 
preceding such G-A pairs assume a hydrogen bonding scheme 
similar to a trans-H/SB UG pair (Leontis et al. 2002), with a 
single hydrogen bond between N2(G) ... 04(U) concomitant 
with a larger CI' ... Cl'separation(11.2A).TheresultingG-U 
wobble subtends a low At (—1.3°) with the succeeding G-A 
(Fig. 4J) enhancing intra-strand base- 
stacking interactions (Fig. 5E). 



U-G pair succeeding G-U 

A U-G wobble pair succeeding a G-U pair 
is less frequently observed (15.1%). The 
G-U/U-G motif is less preferred due to un- 
favorable steric contacts within it and the 
sugar-phosphate backbone which may 
severely influence the regular A duplex ge- 
ometry (Gautheret et al. 1995). As dis- 
cussed earlier, a negative At (—24.9°) at 
this base step promotes better intra-strand 
base stacking. However, a positive Af (14° 
and 11°) prevails between a WC pair pre- 
ceding and succeeding wobble G-U and 
U-G pairs, respectively (Fig. 4D). As reit- 
erated several times in this article, they 
induce overwinding, causing poor inter- 
strand stacking interactions with the 
flanking WC base pairs (Biswas and 
Sundaralingam 1997). The G-U/U-G mo- 
tif thus exposes the overwound destacked 



Base-pair geometry of wobble pairs cre- 
ates unique features on the structure and 
electrostatic surface potential landscape 
of the duplex, offering itself for metal ion binding and protein 
recognition interactions. The local electrostatics of the major 
groove are altered when the free carbonyl group of uracil (04) 
replaces the invariant amino group of G.C (N4) or A.U (N6) 
WC pairs, thus causing localized regions of deep negative elec- 
trostatic potential (Varani and McClain 2000; Xu et al. 2007). 
The metal ion binding at these sites is an important require- 
ment for substrate recognition by group I intron ribozymes 
(AUain and Varani 1995). Although the localized negative 
electrostatic surface potential at the deep major groove allows 
binding of metal ions or water, its narrow groove width could 




5'G-U/G-A3' 



5' G-U'G-A 3' 



FIGURE 5. Nature of stacking interactions at the interface of wobble G-U and G-A base pairs. 
Stacking of nWC G-A and A-G base pairs with wobble G-U in different structural and hydrogen 
bonding contexts. Carbon atoms of A-G and G-A base pairs are colored gray, while those of 
G-U pairs are colored white. (A) A frcms-H/SE A-G pair preceding G-U wobble exhibits better 
intra-strand base stacking due to low Af (—30.8°). (B) G-A pair preceding G-U wobble results in 
inter-strand base stacking due to large At (+59.5°)-mediated overwinding. (C) A ds-WC/WC 
A-G base pair succeeding a G-U wobble manifests in better intra-strand base stacking due to 
low At (—6.2°). (D) A ds-WC/WC G-A pair succeeding a G-U wobble and occurring within 
the helical stem (right panel) (pdb: lfjg). (E) G-A paired in frans-H/SE scheme and occurring 
at the helical termini (right panel) (pdb: lfjg). Both these pairing schemes facilitate good intra- 
strand stacking as a result of low At (—5.4° and —1.3°) subtended with the succeeding G-U 
wobble. 
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hinder interactions with proteins. Hence, most protein-RNA 
interactions are confined along the minor groove (Hermann 
and Westhof 1999). 

Protein contacts are mediated by the exocyclic N2 
amino group of wobble G-U 

Any feature that breaks the monotony of the direct read-out of 
WC pairs along its minor groove side could serve as a unique 
site for recognition interaction. Occurrence of a nWC pair 
amid WC pairs alters the local geometry of the duplex and pre- 
sents distinctive donor/acceptor groups for interactions. 
Presence of nWC pairs such as a G-U or UG wobble not 
only influence the minor groove surface, due to Af-mediated 
mechanistic effects but also exposes the unpaired exocyclic 
amino group. The significance of these wobble pairs is under- 
scored by their involvement in a large number of protein-RNA 
interactions. 

A G-U wobble pair in the acceptor stem of Escherichia coli 
tRNA AIa (Hou and Schimmel 1988; McClain and Foss 1988) 
and in the D stem of yeast tRNA Asp (Piitz et al. 1991) is found 
to be crucial for recognition and aminoacylation by their cog- 
nate synthetases. Similarly a conserved G-U wobble pair is an 
important requirement for splicing activity of ribozymes 
(Peebles et al. 1995; Strobel and Cech 1995). Replacement of 
a G-U wobble by a nonisosteric G.C or U-G pair retains the 
N2 functional group, but such nonisosteric substitutions, nev- 
ertheless, impede recognition and aminoacylation of tRNA^ 3 
(Hou and Schimmel 1988; Park et al. 1989). On the other 
hand, substitution of base pairs isosteric to the G-U wobble, 
such as A-C and I-U, which lack the N2 exocyclic amino group, 
is shown to result in comparatively weaker interactions at the 
catalytic core of Tetrahymena ribozyme (Strobel and Cech 
1995). However, base-pair substitutions which are both isos- 
teric and comprise the N2 amino group restore the catalytic 
activity (Strobel and Cech 1995), suggesting the role of both 
base-pair isostericity and the determinant functional group 
in interaction and catalysis. 

Protein-RNA interactions involving the wobble base pairs, 
G-U and U-G, are, by and large, mediated by hydrogen bond 
interactions between the exocyclic N2 amino group and side 
chains of Gin, Asp, Glu, Asn, and His. These interactions gen- 
erally arise from the helical regions of protein (helix-loop- 
helix/|3-a-(3 motifs) and reside along the minor groove of 
the RNA duplex. G-U and U-G wobble pairs present within 
the helical stems as well as at the termini participate equally. 
Although a majority of these minor groove interactions engage 
the N2 amino group, a few instances involving N3, 02, and 2'- 
OH are also found. Likewise, interactions involving peptide 
carbonyl groups are seen occasionally. 

An example of such interaction with N3 (G) is observed in 
the 50S ribosomal subunit of E. coli (pdb: 3r8s). The loop re- 
gion of ribosomal protein LI 8 positioned along the minor 
groove side of the Gj-Um wobble facilitates hydrogen bond 
interactions with Ser45 (Oy) and G9 (N3). Gly44, adjacent 



to the interacting serine, forms a hydrogen bond involving 
peptide carbonyl with 2'-OH of G9. Ser214 from the 
Haloarcula marismortui large ribosomal protein L2P (pdb: 
ls72) also participates in a similar main chain hydrogen- 
bonded interaction with 2'-OH of G 1898 which pairs with 
U1939. A rare protein-nucleic acid interaction involving methi- 
onine is observed in the H. marismortui large ribosomal sub- 
unit (pdb: ls72). Met23 of the L30P subunit interacts with 
U942G1024 of 23S rRNA through main-chain and side-chain 
groups. Flipping of the unpaired A 943 succeeding the 
U942G1024 pair positions Met23 into the minor groove, facil- 
itating the main-chain carbonyl and S5 in forming hydrogen 
bonds with 2'-OH (G 10 24) and N2, respectively. 

G-U wobble pairs accommodate eight conserved hydra- 
tion sites, five at the major groove and three at the minor 
groove (Auffinger and Westhof 1998). Among them, three 
(two major groove and one minor groove) hydration sites 
were proposed as integral elements of the tRNA Ala acceptor 
stem and implicated in synthetase-tRNA interaction (Mueller 
et al. 1999). The invariant water molecule at the minor 
groove that forms a hydrogen bond network bridging N2 
(G), 02 (U), and 2'-OH (U) atoms of the wobble pair 
is displaced upon protein-mediated interaction with the 
G 38 U 45 3 wobble pair in the Deinococcus radiodurans large 
ribosomal subunit (pdb: 2zjr). Here, Arg40 of the ribosomal 
protein L4 interacts with G 38 U4 5 3 of 23S rRNA by form- 
ing hydrogen bonds between NH1, NH2 (Arg40), and 02 
(U453) (Fig- 6A). 

Wobble pair-mediated kink in a helix 

Interactions of RNA with proteins or RNA are known to bring 
about conformational changes both in protein and RNA 
(Frankel and Smith 1998; Ellis and Jones 2008). Instances of 
conformational change in proteins, seemingly influenced by 
wobble pairs, are noticed in the large subunit of bacterial ribo- 
somes. A distinct bend (~25°) is observed in helix 3 of E. coli 
50S ribosomal protein L20 interacting with 23S rRNA (pdb: 
3r8s) . Interestingly, such a bend of nearly the same magnitude 
is also found in the ribosomal protein of Thermus thermophilic 
(pdb: 2wdl) and D. radiodurans (pdb: 2zjr). The site of the 
bend, in all of these, entices a number of interactions (Fig. 
6E) linking the wobble pairs of the RNA duplex and the pro- 
tein. The tandem wobble pairs U 5 34-G55 9 /G535-U 5 5 8 of the E. 
coli 23S rRNA interact with Asp48, Gln51, and Gln55 along 
the minor groove face of the RNA duplex. Asp48 and Gln5 1 
form hydrogen bond interactions with N2 amino and 02' 
hydroxyl groups of the U 5 34-G 5 59 wobble, while Gln55 makes 
contact with the N2 amino group of the succeeding G 5 3 5 -U5 58 
wobble. Apart from hydrogen bond interactions with the 
wobble pairs, Gln51 and Gn55 are engaged in side chain- 
side chain interactions. A conserved basic residue Arg52 aris- 
ing from the same helix makes multiple hydrogen bond inter- 
actions with the phosphate backbone atoms of the wobble 
G535-U 558 and the succeeding WC pair. 
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FIGURE 6. G-U wobble pair-mediated protein-RNA interactions. (A) Interaction between argi- 
nine of L4 protein and 02-carbonyl (U 453 ) of wobble G 38 -U 4 53 of Deinococcus radiodurans 23S 
rRNA along the minor groove. Interaction along the major groove involving (B) Arg2 from the 
Haloarcula marismortui ribosomal protein L2P and the U 8 7 2 G 8 7 8 wobble pair of 23S rRNA, 
(C) His64 from the mouse NF-kB p50 subunit and the Ui 3 -G 18 wobble pair, (D) Lys55 from 
the Thermus thermophihis ribosomal protein Sll and the G 69 o-U 69 7 wobble pair of 16S rRNA. 
(£) Extensive hydrogen bond interactions between polar residues of the Escherichia coli L20 ribo- 
somal protein with tandem ^34-6559/6535-^58 pairs of 23S rRNA. Note the sharp bend in helix 
a3 at the interaction loci involving wobble pairs. 



The H. marismortui large ribosomal 
protein L2P (pdb: ls72) provides an ex- 
ample of the major groove interaction in- 
volving both 06 and 04 acceptors. 
Helical uncoupling at the loop in 23S 
rRNA exposes the terminal U 87 2-G 878 
wobble pair for interaction with Arg2 of 
ribosomal protein L2P. The succeeding 
G 873 -A 876 pair and the flipped out G 877 fa- 
cilitates Arg2 to access the wobble pair 
through the major groove and forms hy- 
drogen bonds with NH2 . . . U 872 (04) 
and G 878 (06) (Fig. 6B). The mouse nu- 
clear factor NF-kB p50 subunit (pdb: 
looa) recognizes the distorted major 
groove surface of the RNA aptamer and 
makes conserved interactions with the 
Ui 3 -Gi 8 wobble pair (Huang et al. 2003). 
His64 (ND1) from the LI loop region of 
NF-kB p50 subunit interacts with G l8 
(N7 and 06) in ways identical to the con- 
tacts (Fig. 6C) made with its DNA sub- 
strate (Huang et al. 2003). Interactions 
involving a noncanonical G-U base pair 
at the major groove interaction site is ob- 
served in the T. thermophilics 16S rRNA 
(pdb: lfjg). A stretched G 690 -\J 697 pair 
(tams-H/SE U-G) with the CI'... CI' 
separation of 1 1 .4 A and a single hydrogen 
bond (N2-H . . . 04) interacts with Lys55 
of ribosomal protein Sll involving N7 
and 06 of G 690 (Fig. 6D). 

Analyses indicate that At- induced local 
helical twist variations are not altered 
even when wobble pairs are involved in 
interaction with proteins. Analyses of 
CI' ... CI' distance and helical twist an- 
gles show minor changes in the helical 
twist angles (±2°) and CI'... CI' dis- 
tances (-0.15 A). 



Wobble-mediated interactions 
at the major groove 

Although the narrow nature of major groove width of RNA 
duplexes limits its interaction with proteins, uncoupling of 
helices at large internal loops (Weeks and Crothers 1993), to- 
gether with groove width variations, due to the occurrence of 
nWC base pairs (Hermann and Westhof 1999), facilitates its 
accessibility for protein recognition interaction. Presence of 
potential hydrogen bond acceptors 06, N7, and 04 along 
the major groove of the wobble pair suggests interactions 
mostly involving positively charged amino acid residues. A 
few such instances of major groove interactions involving 
G-U wobble pairs are observed, primarily mediated through 
hydrogen bonds with Arg, His, and Lys. 



G-U WOBBLE IN LONG-RANGE INTERACTIONS 

Apart from their role as potential sites for protein binding, 
nWC pairs play a role in the compact folding of large mole- 
cules such as ribosomes (Nissen et al. 2001). They participate 
in packing of helices through long-range interactions in mo- 
tifs such as kink-turn (Klein et al. 2001), along-groove pack- 
ing (Gagnon and Steinberg 2002), and the A-wedge motif 
(Gagnon and Steinberg 2010). 

G-U wobble-mediated tertiary interactions are of critical 
importance in the "along-groove packing motif (Gagnon 
and Steinberg 2002). Here, a "central" G-U wobble pair is en- 
gaged in long-range interactions with a WC base pair along the 
interface formed by the minor grooves of the two interacting 
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helices. The nonisosteric nature of the G-U pair compared to a 
WC pair causes U of the G-U pair to move away (due to resid- 
ual twist) from the minor groove, creating enough room to fa- 
cilitate close interaction with the WC pair (G.C) of the other 
helix (Gagnon et al. 2006). This promotes tight packing of 
the two helices mediated by hydrogen bond interactions 
(Gagnon and Steinberg 2002; Gagnon et al. 2010), as shown 
in Figure 7 A. Mutation of the G-U wobble to WC or to other 
nWC pairs proves detrimental for helical packing, resulting 
in ribosomal inefficiency (Gagnon et al. 2006). Replacement 
of G-U by a U-G pair would not only alter the minor groove 
surface complementarity (G-U # U-G) but also lead to steric 
clashes at the interface ( Gagnon et al. 20 1 0 ) , weakening the he- 
lical packing. Likewise, helical packing would be destabilized 
when the groove packing motif comprises two central WC 
base pairs (Gagnon and Steinberg 2002). Replacement of 
G-U wobble by G.C results in a significant change in the orien- 
tation of ribose sugars of the "internal" bases, leading to a loss 
in minor groove interactions (Gagnon et al. 2010). Also, dif- 
ferences in base-pair geometry of G.C compared to wobble 
G-U results in the loss of hydrogen bond interactions of the 
N2 amino group with the ribose sugar of the base of the facing 
helix, creating an opening or "crack" (Fig. 7B) at the interface 
(Gagnon et al. 2006, 2010). It is further observed that prefer- 
ence for G.C or C.G as "central" pairs over A.U or U.A 
(Gagnon et al. 2006) reflects the need for maximizing long- 
range interactions for packing stability, provided by additional 
interaction involving the exocyclic N2 amino group of gua- 
nine vis-a-vis adenine. However, swapping of G-U and WC 




FIGURE 7. G-U wobble in long-range interactions. Along-groove pack- 
ing in RNA mediated through extensive hydrogen bond interactions in- 
volving internal (pink) and external (blue) base pairs of the participating 
helices along the minor groove. (A) Interactions involving U 62 -Gi 0 5 and 
G384.C379 base pairs of T. thermophilus 16S rRNA (pdb: lfig). Presence 
of a wobble pair leads to a complementary fit between two strands re- 
sulting in close helical packing mediated by base-sugar interactions. 
(B) Comparatively weak interaction involving C549.G35 ... G544.C501 
base pairs (pdb: lfig) due to nonavailability of the N2 amino group of 
G35 for interaction with the ribose sugar, leading to an opening or 
"crack" at the interface. 



pairs of the two interacting helices does not affect helical pack- 
ing, due to the complementary nature of the interaction 
(Gagnon et al. 2006). The critical nature of the presence of 
the G-U wobble pair in defining the interactions responsible 
for "along-groove packing motif is evident. 

DISCUSSION 

Nucleic acids owe their remarkable structures to the underly- 
ing geometries of their constituent base pairs. The unique WC 
base-pairing scheme endows DNA with both sense and struc- 
tural complementarity, imperative for structural uniformity 
and faithful replication. Besides the geometrical equivalence 
(isostericity) of WC pairs, the nearly identical glycosidic 
bond angles allow interchangeability of bases within them. 
These render WC pairs to be not only isosteric but also self- 
isosteric, facilitating substitution of any one of the four base 
pairs (A.T, G.C, T.A, and C.G) in a duplex without signifi- 
cantly altering the duplex structure. Other possible base pairs 
which do not conform to these are said to be nonisosteric with 
WC pairs. Such nonisosteric base pairs are found in fairly large 
numbers in RNA, although WC pairs dominate the arena, un- 
derscoring the profound importance of base-pair isostericity 
and base-pair self-isostericity in governing the structure and 
function of nucleic acids in general. It is, therefore, of utmost 
importance to understand and obtain an estimate of the de- 
gree of nonisostericity of nWC base pairs compared to the pre- 
ponderant WC pairs. It is recognized that the presence of nWC 
pairs amid WC pairs influences the structure of nucleic acid 
duplexes, and nonequivalent base-pair substitutions are 
found to eliminate function (Chang et al. 1999; Zhong et al. 
2006), thus playing a major role in the evolution of macromol- 
ecules (Gutell et al. 1994). However, rationale for the observed 
nWC base pair-mediated mechanistic effects in RNA duplexes 
still remains vague. In this context, we have attempted to 
define and quantify base-pair nonisostericity between any 
two base pairs (WC/WC or WC/nWC or nWC/nWC) by 
means of a simple and discernible metric (Ar and Ar) that 
also enables visualization of their mechanistic effects in a 
straightforward manner. 

Utility of such a metric is illustrated by taking the example 
of the wobble G-U, which has attracted considerable atten- 
tion due to its abundance and implications in biological func- 
tions. Its juxtaposition with a WC pair imparts an intrinsic 
residual twist (At) of 14° even in the absence of the prescribed 
helical twist of 33°. Its influence on the helical twist angles is 
manifested as local helical overwinding and underwinding, 
depending on whether the wobble G-U succeeds (+Ar) or 
precedes (—At) a WC pair, respectively. This observation is 
consistent with X-ray crystal structure and MD simulation 
data. Origin of such observations seen in RNA duplexes in- 
terrupted by a U-G pair, as well as GU and UG dinucleotides, 
can be readily linked to the effect of magnitude and sign of Ar. 

It is seen that an insertion of a G-U wobble (or any nWC 
pair) amid a RNA duplex comprising WC base pairs creates 
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a break in the phosphodiester link due to At and consequent 
local twist angle variations (over- and underwinding). 
However, the sugar-phosphate backbone flexibility absorbs 
the conformational changes required to re-establish the back- 
bone linkage and underplays the effects of Af, leading to ob- 
served twist angle variations much less than that anticipated 
from At (Fig. 3). Interestingly, data from MD simulations 
(present study) and a few X-ray crystal structures indicate pos- 
sible changes in the P - 05' (a) and C5' - C4' (y) torsion angles 
from gauche~ to gauche' /trans and from gauche + to trans, re- 
spectively, either in the nucleotides associated with the nWC 
base pair or in the nucleotides immediately flanking them 
(Fig. 8). The possible direct or indirect significance of these 
as a structural or functional requirement is not clear. 

Earlier attempts at quantifying base-pair nonisostericity 
(Walberer et al. 2003; Stombaugh et al. 2009) have been 
used to infer base-pair conformations of nWC pairs 
(Gautheret and Gutell 1997; Walberer et al. 2003; Mokdad 
and Frankel 2008), RNA sequence (Stombaugh et al. 2011), 
and structure alignments (Mokdad and Leontis 2006) in iden- 
tification (Sarver et al. 2008; Zhong et al. 2010) and clustering 



(Zhong and Zhang 2012) of RNA structural motifs. Base-pair 
isostericity characterized based on the differences in CI' ... 
CI' separations (Ac) (Stombaugh et al. 2009) can be related 
to Ar by a factor of two (half of the difference in the intra- 
base pair CI' ... CI' distance). Relative displacements of CI' 
atoms (fj) (Stombaugh et al. 2009) and shear (Olson et al. 
2001) are relatable to residual twist {At). Both of these are 
concurrently accounted for in the circular representation. 
For instance, the isodiscrepancy index of G.C/G-U (2.14) 
andU-G/G-U (4.48) pairs (Stombaugh et al. 2009) is roughly 
proportional to the magnitude of Af at G.C/G-U (+14.0°) and 
U-G/G-U (+24.9°). Though correlation between isodiscre- 
pancy index and At seemingly draws parallels between the 
two metric schemes, such straightforward relation may not 
hold true for other base-pair combinations wherein both At 
and Ar influence nonisostericity. For example, the isodiscre- 
pancy index of G-A/G.C (3.49) and C-U/GC (3.44) 
(Stombaugh et al. 2009) does not allow invisioning its effect 
either through local (helical twist angle variation) or global 
(kink or bend) conformation of RNA structure. It merely 
gives a measure to comprehend the extent of nonisostericity 
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FIGURE 8. Correlated backbone conformational variations. Scatter plots depicting conformational changes in the backbone torsion angles, a (P - 
05') and y (C5' - C4') in nucleotides associated with wobble base pairs (A,B) and in the nucleotides immediately flanking them (C,D). Left (A,C) and 
right (B,D) panels correspond to data obtained from crystal and MD simulation structures, respectively. 
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between two base pairs. In contrast, the distinct measures of 
base-pair nonisostericity metrics, At and Ar, directly relate 
the cause and effect, providing readily discernible means to 
base-pair nonisostericity-mediated mechanistic effects. It is 
noteworthy that nWC base pairs such as C-U and G-A display 
intrinsic residual twists of Ar» —8.3° and +7.8° respectively, 
with WC base pairs despite the absence of shear (Fig. 9), 
further highlighting the usefulness of describing base-pair 
nonisostericity through circular representation. This seems 
natural since a circular strip is part of a cylindrical helix. 

The base-pair nonisostericity metric discussed here is appli- 
cable in principle to all ris-nWC base-pair combinations 
(Table 1). It is seen that certain nWC 
base pairs such as Uh-U L , and C-C + exhibit 
a large At with the WC pair, while similar 
large values of Af are also observed among 
nWC pairs such as GG7G-U, U h -U L /G-A, 
A L -A h /Uh-U L , C-C + /A L -A h . Likewise, pro- 
nounced Ar values are observed for nWC 
pairs such as G-A, A-A, U-U, C-C + , U-C 
with respect to the WC pair (Table 1). 
Such large values of A rand Ar are expected 
to significantly affect RNA conforma- 
tions. These might be relevant for protein 
recognition interactions, as nWC pairs 
like U-U, U-C, C-C, A-A are involved in 
trinucleotide repeat expansion diseases 
(Gatchel and Zoghbi 2005) such as myo- 
tonic dystrophy (Kumar et al. 201 1), frag- 
ile X syndrome E (Kiliszek et al. 2012), 
Huntington's disease, dentatorubral-pal- 
idoluysian atrophy, and spinocerebellar 
ataxias (Kiliszek et al. 2010). Besides Ar, 
one might also expect Ar to play a sig- 
nificant role on the global conformation 
of RNA duplexes comprising purine- 
purine or pyrimidine-pyrimidine pairs. 
Thus, Arand Ar would aid in understand- 
ing sequence-dependent RNA confor- 
mations arising out of nonisostericity of 
base pairs. 

The scheme is readily extendable to 
study the effects of nWC base pairs in 
DNA duplexes such as G-T wobble and 
can also be adopted for description of 
base-pair nonisostericity among trans- 
H/SE base pairs (Table 4). Efficacy of 
the concept of residual twist in predicting 
mechanistic effects of base triplet noni- 
sostericity in DNA triplexes has been 
demonstrated (Thenmalarchelvi and 
Yathindra 2005, 2006) and has been ap- 
plied in developing an algorithm to iden- 
tify triplex- forming sequences (Lexa et al. 
2011). It is envisaged that the knowledge 



of residual twist and radial difference can be effectively uti- 
lized for a comprehensive understanding of the structural in- 
fluence of base-pair nonisostericity in nucleic acids duplexes/ 
triplexes. 

MATERIALS AND METHODS 

Modeling of Watson-Crick and non-Watson-Crick 
base pairs 

Watson-Crick and non-Watson-Crick cis base pairs are modeled us- 
ing standard hydrogen bond length and angles and paired via 




GC/UL-Uh 



GCOC 



GC/AL-Ah 




GC/GA + 



GC/U-C 



GC/OU 




GC/Uh-UL 



GC/GC+ 



GC/Ah-AL 



FIGURE 9. Visualization of nonisostericity between WC and nWC base pairs. Superposition of 
various nWC base pairs (brown) on a G.C base pair (representative of WC pairs; green). Base 
pairs are encompassed by circles with the respective CI' ... CI' separation as diameters. +At in 
A-F and —At in G—L suggest local overwinding and underwinding, respectively. 
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FIGURE 10. ris-WC and -nWC base pairs. CI' ... CI' separations (d) and 
(X) are used to describe various cis- WC and ds-nWC pairs (Leontis et al. 
with high \-value are designated Aj, and Uh, respectively. 



schemes suggested by Leontis et al. (2002). Glycosidic bond angles 
(\) and the CI' ... CI' separations (d) of all modeled base pairs 
are shown in Figure 10. Base pairs are modeled using the graphical 
interface of Insight II (Accelrys Inc.). 



Base-pair superposition 

Base-pair superposition is achieved by aligning CI', N9, C8, C4/C1', 
Nl, C6, C2 atoms of purines/pyrimidines using the graphical inter- 
face of Insight II (Accelrys Inc.). 



Definition of residual twist (At) and radial 
difference (Ar) 

Residual twist (At) is calculated by measuring the angle between the 
line joining CI' ... CI' atoms of the superimposed base pairs when 
the helical twist angle (f) required to generate a helical structure is 
not effected, i.e., when t = 0. The residual twist angle subtended 
by the 3' base pair in the counter-clockwise direction with respect 
to the 5' base pair is assigned a positive value and likewise, the resid- 
ual twist angle subtended by the 3' base pair in the clockwise direc- 
tion with respect to the 5' base pair is assigned a negative value. The 
radial difference (Ar) corresponds to half of the difference between 
the CI' ... CI' separations (diameter) of the superposed base pairs. 
Ar is assigned a positive sign if the radius of the 3' base pair is greater 
than the radius of the 5' base pair and negative sign if the radius of 
the 3' base pair is less than the radius of the 5' base pair. 



glycosidic bond angles 
2002). Bases A and U 



(Berman et al. 2000). Instances of helix-loop 
regions from crystal structures were obtained 
from RNA FRABASE 2.0 (Popenda et al. 

2010) . Crystal structure data on protein- 
RNA interaction involving G-U wobble pairs 
were obtained from the Protein-RNA In- 
terface Database (PRIDB) v 1.0 (Lewis et al. 

2011) and manually curated. 

MD simulation 

RNA duplex models comprising wobble pair 
G-U, U-G (9-mers) and tandem repeats 
G-U/U-G, U-G/G-U, and G-U/G-U (10-mers) 
are built using the Discovery studio 2.5 pack- 
age (Accelrys Inc.) and optimized using the 
steepest descent algorithm embedded in the 
package. They are used as the starting model 
for MD simulations. Using the LEaP module 
of AMBER 11, RNA duplexes are solvated in 
a periodic box of TIP3P waters, and neutraliz- 
ing Na+ counter ions were added. The equil- 
ibration and production runs are pursued 
following the protocols described in http 
://ambermd.org/tutorials/basic/tutoriall. Production runs are car- 
ried out using the Sander module of AMBER1 1 for 20 nsec on a 
high-performance computing cluster (dual hexacore, seven nodes). 
Simulation is performed under isobaric and isothermal conditions 
with SHAKE (tolerance = 0.0005 A) on the hydrogens (Ryckaert 
et al. 1977), with a 2-fsec integration time and a cut-off distance 
of 9 A for Lennard-Jones interaction. Trajectories are analyzed using 
the Ptraj module of AMBER11. 

Calculation of helical twist and backbone 
torsion angles 

Helical twist angle and backbone torsions were estimated using 
X3DNA (Lu and Olson 2003). Snapshots taken at every 200 psec 
of MD simulation (Schemes Al to A5) were used in the calculation. 
In conformity with the concept of residual twist, helical twist angles 
are calculated using CI' ... CI' vectors. 
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Crystal structure data mining 

Crystal structures of RNA duplexes used in the analysis are selected 
from a list of nonredundant RNA structures (resolution <3 A) from 
the RNA 3D Hub (http://rna.bgsu.edu/nrlist/; Feb. 2012 release). 3D 
coordinates are retrieved from NDB (Berman et al. 1992) and PDB 
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